GT relation aware attn by yliu2-sc · Pull Request #599 · Snapchat/GiGL

yliu2-sc · 2026-04-27T22:31:30Z

Scope of work done

Where is the documentation for this feature?: N/A

Did you add automated tests or write a test plan?

Updated Changelog.md? NO

Ready for code review?: NO

zfan3-sc · 2026-05-08T20:20:21Z

+        self._relation_attention_matrices: Optional[nn.Parameter] = None
+        if relation_attention_mode == "edge_type_additive":
+            self._relation_attention_matrices = nn.Parameter(
+                torch.empty(num_relations, num_heads, self._head_dim, self._head_dim)


If we look at HGT's equation 3, the W^(ATT)_{\phi(e)} does't have a super script i, so I think the edge type specific transformation is per edge type, not per (edge type, head index). The current implementation has more capacity but we are not doing apple-to-apple comparison with HGT

Also it seems like we are missing the \mu attention multiplier per (source_type, edge_type, destination_type) in equation 3 if we compare with HGT

zfan3-sc · 2026-05-08T20:35:28Z

+            pairwise_relation_mask=pairwise_relation_mask,
+        )
+        if relation_attention_bias is not None:
+            attn_bias = (


HGT uses KWQ^T, whereas we use KQ^T + KW*Q^T. So basically we reparametrized HGT's formula with W'^ATT = I + W^ATT. I think we have the same expressiveness, but maybe should initialize our W^ATT differently. zero-init or small sigma gaussian init could be an option, as xavier could make the variance a bit too big for bias. I'm open to discussion

zfan3-sc · 2026-05-08T20:53:44Z

+        active_relation_ids = torch.unique(active_relation_positions[:, 3], sorted=True)
+
+        for relation_idx_tensor in active_relation_ids:
+            relation_idx = int(relation_idx_tensor.item())


this .item() can the above active_relation_ids = torch.unique(active_relation_positions[:, 3], sorted=True) causes GPU sync. Some AI suggestions to avoid it for performance

# remove the torch.unique line above for relation_idx in range(self._num_relations): # remove the .item() line relation_positions = active_relation_positions[ active_relation_positions[:, 3] == relation_idx ] ...

yliu2-sc added 8 commits April 15, 2026 19:35

max label per anchor

bbe345f

Merge branch 'main' of github.com:Snapchat/GiGL

83dadce

attn

ea4a0d9

optim

6b60820

merge main

257d07d

fix conflict

b1e508f

clean up

14b4a4c

updates

808e47f

zfan3-sc reviewed May 8, 2026

View reviewed changes

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

GT relation aware attn#599

GT relation aware attn#599
yliu2-sc wants to merge 8 commits intomainfrom
yliu2/gt_relation_aware_attn

yliu2-sc commented Apr 27, 2026

Uh oh!

zfan3-sc May 8, 2026

Uh oh!

zfan3-sc May 8, 2026

Uh oh!

zfan3-sc May 8, 2026

Uh oh!

zfan3-sc May 8, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

Conversation

yliu2-sc commented Apr 27, 2026

Uh oh!

zfan3-sc May 8, 2026

Choose a reason for hiding this comment

Uh oh!

zfan3-sc May 8, 2026

Choose a reason for hiding this comment

Uh oh!

zfan3-sc May 8, 2026

Choose a reason for hiding this comment

Uh oh!

zfan3-sc May 8, 2026

Choose a reason for hiding this comment

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants